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Abstract — The  traditional  problem  in  binary  decision  diagrams  (BDDs)  has  been  to  minimize  the  number  of  nodes  since  this  reduces 
the  memory  needed  to  store  the  BDD.  Recently,  a  new  problem  has  emerged:  minimizing  the  average  path  length  (APL).  APL  is  a 
measure  of  the  time  needed  to  evaluate  the  function  by  applying  a  sequence  of  variable  values.  It  is  of  special  significance  when  BDDs 
are  used  in  simulation  and  design  verification.  A  main  result  of  this  paper  is  that  the  APL  for  benchmark  functions  is  typically  much 
smaller  than  for  random  functions.  That  is,  for  the  set  of  all  functions,  we  show  that  the  average  APL  is  dose  to  the  maximum  path 
length,  whereas  benchmark  functions  show  a  remarkably  small  APL  Surprisingly,  however,  typical  functions  do  not  achieve  the 
absolute  maximum  APL.  We  show  that  the  parity  functions  are  unique  in  having  that  distinction.  We  show  that  the  APL  of  a  BDD  can 
vary  considerably  with  variable  ordering.  We  derive  the  APL  for  various  functions,  including  the  AND,  OR,  threshold,  Achilles'  heel,  and 
certain  arithmetic  functions.  We  show  that  the  unate  cascade  functions  uniquely  achieve  the  absolute  minimum  APL. 

Index  Terms — Binary  decision  diagrams,  BDD,  average  path  length,  APL,  worst-case  path  length. 

-  ♦  - 


1  Introduction 

considerable  research  has  been  devoted  to  the  use  of 
binary  decision  diagrams  (BDDs)  in  logic  design, 
dating  back  40  years  to  Lee's  [1]  original  paper.  Most  of 
this  research  followed  from  the  seminal  paper  by  Bryant  [2], 
who  showed  that  a  reduced  ordered  BDD  (ROBDD)  is  a 
canonical  representation  of  a  logic  function.  The  focus  of  the 
latter  paper  and  many  papers  [2],  [3],  [4],  [5]  that  followed 
has  been  on  minimizing  the  number  of  nodes.  This  is 
inspired  by  the  memory  needed  to  store  the  BDD,  which 
can  be  large  for  practical  circuits. 

However,  a  different  cost  measure  is  also  important.  For 
each  assignment  of  values  to  the  variables  of  a  function  f(X), 
there  is  a  path  from  the  root  node  to  one  of  two  terminal 
nodes,  representing  a  value  of  f(X).  Summing  this  path 
length  over  all  2"  assignments  and  dividing  by  2n  yields  the 
average  path  length  or  APL  of  f(X).  In  general,  the  APL 
depends  on  the  order  of  the  variables  and  we  denote  the 
minimum  APL  over  all  possible  orderings  as  APLj(X)-  In 
logic  simulation,  the  function  realized  is  verified  by 
applying  (typically  many)  test  vectors  [7],  [8],  [9].  A  BDD 
with  a  small  APL  can  be  evaluated  quickly  and  is  important 
in  logic  simulation  applications.  There  are  at  least  three 
papers  that  describe  the  minimization  of  the  APL  of  BDDs 
[9],  [10],  [11].  Iguchi  et  al.  [12]  compare  the  APL  for  four 
different  decision  diagrams  representing  multioutput  func¬ 
tions.  Minimizing  the  APL  of  multivalued  functions  is 
discussed  in  [13]. 
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BDDs  have  been  used  in  the  design  of  pass  transistor 
logic  circuits  [28],  [29],  [30],  [31],  which  have  the  advantage 
of  low  power  dissipation.  In  this  case,  there  is  the  prospect 
of  choosing  an  ordering  with  a  small  APL  as  this  yields 
smaller  delay.  Also,  in  a  logic  synthesis  involving  functional 
decomposition,  the  paths  in  the  BDD  representing  the 
function  create  don't  care  values  from  missing  variables.  In 
this  case,  minimizing  the  path  length  has  the  effect  of 
increasing  don't  care  values  [32],  which,  in  turn,  reduces 
circuit  complexity. 

APL  has  been  important  in  other  contexts.  For  example. 
Bell  [14]  has  considered  its  use  in  decision  trees  for  pattern 
recognition  systems.  In  this  application,  one  seeks  to  answer 
the  fewest  questions  needed,  on  the  average,  to  identify  the 
object.  APL  is  useful  in  the  analysis  of  algorithms,  where 
decision  trees  have  been  used  to  determine  lower  bounds 
on  the  complexity  of  sorting  algorithms  [15].  Applications 
also  occur  in  databases  [16].  Qin  and  Malik  [17]  consider 
APL  in  decision  trees  for  the  decoding  logic  for  micro¬ 
processor  instructions.  For  more  information  on  research 
prior  to  1982,  the  reader  is  referred  to  Moret  [18].  For 
information  on  more  recent  research,  the  reader  is  referred 
to  Murthy  [19].  We  address  the  following  question: 

Does  the  APL  of  a  BDD  of  a  logic  function  vary  significantly 
enough  with  the  variable  ordering  to  merit  a  close  examination? 

In  Fig.  1,  we  show  the  BDD  of  the  carry-out  of  a  binary 
adder.  Fig.  la  shows  the  circuit,  which  consists  of  b  full- 
adders  connected  in  a  ripple-carry  configuration.  It  is 
assumed  there  is  a  carry-in  variable,  Cjn. 

Fig.  lb  shows  the  BDD  for  this  function,  where  the 
variables  are  in  descending  order  (most  significant  bits  at 
the  top),  and  Fig.  lc  shows  the  BDD  in  which  the  variables 
are  in  ascending  order  (least  significant  bits  at  the  top).  Fig.  2 
shows  the  distribution  of  path  lengths  for  the  two  orderings 
for  a  b  =  16  bit  adder.  The  pattern  associated  with  the  most 
significant  bit  at  the  top  is  a  sawtooth  because  there  are  no 
paths  of  odd  length.  In  this  distribution,  most  paths  are 
short,  resulting  in  an  APL  of  4.0.  The  paths  associated  with 
the  least  significant  bits  at  the  top  are  long,  resulting  in  a 
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Fig.  1 .  Two  BDDs  for  the  carry-out  function  of  a  6-bit  ripple  carry  adder, 
(a)  Carry-out  cout  of  6-bit  ripple  carry  adder  circuit,  (b)  BDD  for  CoUt  with 
MSB  at  top.  (c)  BDD  for  cout  with  LSB  at  top. 


large  APL,  25.0.  Both  orderings  yield  the  same  number  of 
nodes.  Thus,  the  APL  of  the  carry-out  function  is  strongly 
dependent  on  the  variable  ordering. 

In  this  paper,  we  focus  on  reduced  ordered  binary  decision 
diagrams  (ROBDD),  as  proposed  by  Bryant  [2].  As  such,  the 
order  of  the  variables  along  all  paths  from  the  root  node  to  a 
terminal  node  is  the  same  (corresponding  to  the  O  of 
ROBDD).  Further,  each  sub  function,  as  represented  by  an 
assignment  of  values  to  variables  beginning  at  the  root  node, 
is  represented  by  a  unique  node  (corresponding  to  the  R  of 
ROBDD).  As  is  common,  we  use  BDD  to  abbreviate  ROBDD. 

In  this  paper,  we  do  not  discuss  methods  for  minimizing 
the  APL  of  BDDs.  There  are  a  number  of  approaches  for 
doing  this.  One  is  to  enumerate  all  orderings  and  to  choose 
the  one  that  produces  the  smallest  APL.  A  more  efficient 
approach  is  branch  and  bound  [11],  which  can  be  applied  to 


functions  with  up  to  25  variables.  Both  methods  find  the 
exact  minimum.  For  functions  on  more  variables,  heuristic 
methods  must  be  used.  Two  heuristics  include  a  window 
permutation  approach  [10]  and  a  dynamic  variable  reorder¬ 
ing  method  [11],  [33]. 

This  paper  is  organized  as  follows:  In  Section  2,  we 
discuss  specific  functions.  As  far  as  we  know,  this  is  the  first 
analysis  of  important  functions  with  respect  to  the  APL  of 
their  BDD.  In  Section  3,  we  analyze  the  average  of  the  APL 
for  classes  of  functions,  including  all  functions  and  all 
symmetric  functions.  We  show  that  these  averages  are  near 
the  maximum  values.  We  show,  however,  that  the  set  of 
symmetric  threshold  functions  does  not  share  this  char¬ 
acteristic.  In  Section  4,  we  show  that  benchmark  functions 
have  a  relatively  small  APL.  Section  5  discusses  conclu¬ 
sions.  The  reader  interested  in  just  the  results  has  only  to 
read  the  main  part.  For  the  reader  interested  in  the  details, 
we  have  provided  proofs  in  the  Appendix. 

2  The  APL  of  Specific  Functions 
2.1  AND/OR  Function 

The  BDD  of  an  n-variable  AND  function,  shown  in  Fig.  3a 
consists  of  a  single  path  from  the  root  node  to  the  terminal 
node  labeled  1.  For  any  node,  there  is  an  edge  to  the  terminal 
node  labeled  0.  Therefore,  we  expect  the  APL  to  be  small. 

Theorem  1. 


APLAND(n)  -  2  -  ■ 

Proof.  See  the  Appendix.  □ 

This  has  been  reported  by  Breithart  and  Gal  [20].  We  extend 
this  by  deriving  also  the  distribution  of  path  lengths  for  the 
AND  function  (see  the  Appendix). 

Since  the  BDD  of  the  OR  function  is  isomorphic  to  that  of 
the  AND  function,  APLand („)  =  APLqr^.  The  AND  and 
OR  functions  belong  to  a  special  class  of  functions. 

2.2  Unate  Cascade  Functions 

Definition  1  [21].  /  is  a  unate  cascade  function  if  f  can  be 
represented  as 

/(aii,  *2,  •  •  •  xn)  =  a:[0i(a:;<>2(-  •  •  «_!<>«- 1<))  •  •  •), 


Fig.  2.  Distributions  of  path  lengths  for  the  carry-out  function. 
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0  |  fEXOR  |  1  | 

(b) 


Fig.  3.  BDDs  for  the  (a)  AND  and  (b)  EXOR  functions. 

where  x*  is  either  a or  Xi  and  •(>,  is  either  the  OR  (V)  or  AND 
(A)  operator. 


The  AND  and  OR  functions  are  unate  cascade  functions. 
The  BDD  of  a  unate  cascade  function  has  one  edge  from 
every  node  to  one  of  the  two  terminal  nodes,  except  the 
bottom  node,  where  two  edges  are  incident  to  the  terminal 
nodes.  We  have 

Theorem  2.  The  unate  cascade  functions  are  uniquely  those 
functions  whose  BDDs  have  the  smallest  APL  (2  —  A=i) 
among  all  functions  that  depend  on  n- variables. 

Proof.  See  the  Appendix.  □ 

Having  identified  exactly  those  functions  whose  BDD  has 
the  smallest  APL,  we  now  consider  which  functions  have 
the  largest  APL. 

2.3  Parity  Functions 

Fig.  3b  shows  the  BDD  of  the  Exclusive  OR  function.  All 
paths  have  length  n,  a  result  of  the  fact  that  every  variable 
in  every  assignment  of  values  is  needed  to  determine  the 
function  value.  These  statements  also  apply  to  the  comple¬ 
ment  of  the  Exclusive  OR  function.  Collectively,  these  two 
functions  form  the  parity  functions.  Since  all  path  lengths  are 
n,  the  APL  is  n. 

Theorem  3.  The  parity  functions  are  uniquely  those  functions 
whose  BDDs  have  the  largest  APL  (n)  among  all  functions 
that  depend  on  n  variables. 

Proof.  See  the  Appendix.  □ 

Theorems  2  and  3  completely  characterize  the  functions 
with  the  smallest  and  the  largest  APL.  We  now  establish 
properties  of  functions  whose  APL  falls  between  these 
extremes,  beginning  with  functions  whose  BDDs  are  similar 
to  the  parity  functions.  From  Moret  [18],  we  have 

Definition  2.  A  variable  Xi  in  function  f(X)  is  indispensable  iff 
for  every  assignment  ~a  e  EA~l  of  values  to  X  —  {xt}, 

/(«%<= o  +  /("«) U=i/  where  B  =  (°>  !}■ 

For  example,  in  the  function 

f(x  1,X2,X3)  =  Xi  ®X-2®X3Xi, 


Fig.  4.  BDDs  for  two  comparison  functions,  (a)  Equal-to  function, 
(b)  Greater-than-or-equal-to  function. 

x\  and  x-2  are  indispensable,  but  £3  and  £4  are  not.  The 
descriptor  "indispensable"  is  due  to  the  fact  that  it  is  always 
necessary  to  specify  the  value  of  an  indispensable  variable 
in  order  to  determine  the  value  of  the  function.  From  this, 
we  can  conclude 

Theorem  4.  Let  f(X)  be  a  function  with  N  indispensable 
variables.  Then,  all  paths  in  any  BDD  of  f(X)  have  a  length  of 
at  least  N,  and  APLpX)  >  N. 

For  a  given  assignment  ~a  of  values  to  X  —  {x^, 
f{X) U=o  =  f(X) U=1.  It  follows  that  f(X)\Xi=0  =  f(X)\Xi=1 
and  f(X)  =  f(X)\Xi=0  ©  xt.  Thus,  we  can  state 

Theorem  5.  Let  f(X)  be  a  function  with  N  indispensable 
variables,  yi,  1/2,  . . . ,  and  yx.  Then, 

f{X)  =  g{X-  {2/1,222,  ...,2/v})  ©2/1  ©1/2  ©  ■  ■  •  ®Vn- 

Theorem  5  shows  that  the  functions  with  the  most 
indispensable  variables  are  the  two  parity  functions.  It 
also  shows  that  there  are  no  functions  dependent  on 
n  variables  that  have  n  —  1  dispensable  variables.  There 
are  functions  with  n  —  2  indispensable  variables,  e.g., 

/(£lj  £2,  £3)  =  £l  ©£2  ©£3£4- 

2.4  Comparison  and  Carry-Out  Functions 

We  examine  two  comparison  functions,  the  Equal-to  and 
the  Greater-than-or-equal-to  functions,  as  well  as  the 
carry  out  function.  Let  X  =  {xb-i,Xb-2,  ■  ■  ■  ,Xq}  and  Y  = 
{jjb-i,  2/6-2,  •••,  2/0}  be  the  input  variables  that  represent  a 
standard  binary  number.  That  is,  x  =  £&_i26-1  +  £f,_226-2  + 

. . .  +  £02°  and  y  =  2A-i26-1  +  t/6_2 26-2  +  . . .  +  2/o2°  are  the 
standard  binary  representations  corresponding  to  logic 
variable  sets  X  and  Y,  respectively.  Then,  the  Equal-to 
function  fx=v(X,Y)  =  1  iff  x  =  y  and  the  Greater-than-or- 
equal-to  function  fx>y{X,  Y)  =  1  iff  x  >  y.  Fig.  4  shows  the 
BDDs  for  these  functions. 

The  Equal-to  function  is  1  iff  the  pair  of  ith  significant 
bits  are  identical  for  all  i.  Thus,  its  BDD  structure  is 
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Function 

APL 

Carry-out 

4  —  3/2b 

Equal-to 

4  —  4/2b 

Greater-tlian-or-equal-to 

4^ 

1 

Cn 

to 

cr 

unaffected  by  a  reordering  of  the  pairs.  For  the  Greater- 
than-or-equal-to  function,  the  smallest  APL  occurs  when 
the  MSB  is  at  the  top,  as  is  the  case  for  the  carry-out 
function.  Fig.  4  shows  its  structure  when  the  MSB  is  at  the 
top.  We  can  state 

Theorem  6.  The  smallest  APL  in  a  BDD  for  the  Greater-than-or- 
ecjual-to  function  is  4  —  Jfc  and  is  achieved  when  variables  are 
ordered  in  ascending  significance. 

Proof.  See  the  Appendix.  □ 

Note  that  the  BDD  structure  for  the  carry-out,  as  shown  in 
Fig.  lb,  and  the  comparison  functions,  as  shown  in  Fig.  4,  are 
similar.  This  suggests  that  their  APL  is  similar.  Table  1  shows 
the  APL  for  each  of  these  functions.  It  is  a  constant  4  less  a 
function  of  6,  the  number  of  bits,  that  approaches  0  as  b 
increases.  Thus,  in  the  limit,  all  three  BDDs  have  an  APL  of  4. 
Note  that  the  (small)  difference  in  the  APL  values  is  explained 
entirely  by  the  structure  at  the  bottom  of  the  BDDs. 

2.5  Symmetric  Threshold  Functions 

A  (totally)  symmetric  function  is  unchanged  by  any  permuta¬ 
tion  of  its  input  variables.  For  example,  the  AND,  OR,  and 
parity  functions  are  symmetric.  A  symmetric  threshold 
function  is  a  symmetric  function  that  is  1  iff  t  or  more  of 
its  variables  are  1,  for  0  <  t  <  n  +  1.  For  example,  the  AND 
and  OR  functions  are  symmetric  threshold  functions,  where 
t  =  n  and  t  =  1,  respectively.  Flowever,  the  Exclusive  OR 
function  is  not  a  symmetric  threshold  function,  although  it 
is  a  symmetric  function.  There  are  n  +  2  symmetric  thresh¬ 
old  functions  on  n  variables. 

Fig.  5  shows  the  distribution  of  path  lengths  for 
17  symmetric  threshold  functions  on  33  variables.  The 


Fig.  5.  Distribution  of  path  lengths  for  33-variable  symmetric  threshold 
functions  for  t  from  33  (AND)  to  17  (Majority). 


Fig.  6.  BDD  of  the  5-variable  majority  function. 

AND  function  shown  on  the  extreme  left  corresponds  to 
t  =  33,  while  the  majority  function  (which  is  1  iff  a  majority 
of  the  variables  are  1)  shown  on  the  extreme  right 
corresponds  to  f=  17.  Approximately  one-half  of  the 
symmetric  threshold  functions  are  shown;  all  except  the 
majority  function  distributions  occur  as  pairs.  For  example, 
the  distribution  of  path  lengths  associated  with  the  AND 
function,  where  t  =  33,  also  represents  the  distribution  of 
path  lengths  for  the  OR  function,  where  t  =  1.  For  the  APL 
of  a  symmetric  threshold  function,  including  trivial  func¬ 
tions  /  =  0  and  /  =  1,  we  have 

Theorem  7. 

k  0-j) 

APLS_THRES(n,t)  =2 k-  Y.  {  j, 

3= 1 

where  k  =  min{t,  n  —  t  +  1}. 

Proof.  See  the  Appendix.  □ 

When  n  is  large,  we  can  write 

Corollary  1. 

APLS  THREs(n,t)  ~  2  min{£,  n  —  t+  1}, 
where  A(n)  ~  B(n)  means  lim^oo  =  1. 

2.6  Majority  Function 

When  n  is  odd  and  t  and  n  are  related  by  n  =  2t  —  1,  the 
symmetric  threshold  function  is  the  majority  function.  One  of 
the  two  functions  realized  by  the  circuit  labeled  FA  in  Fig.la  is 
a  3-variable  majority  function:  It  is  1  iff  two  or  three  variables 
are  1.  Fig.  6  shows  the  BDD  of  the  majority  function  on  five 
variables,  which  resembles  a  square.  We  have 
Theorem  8. 

APLMAJM  =  n-^(^+T  (1) 

where  n  is  odd. 

Proof.  See  the  Appendix.  □ 

The  second  term  contains  the  factor  fjjiY  which  is  seM±5- 
The  factorials  can  be  approximated  using  Stirling's  approx¬ 
imation,  yielding 
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Fig.  7.  BDD  of  the  7-variable  BTREE  function. 
Corollary  2. 


APL 


MAJ(n) 


(2)  Fig.  8.  Distributions  of  path  lengths  for  various  33-variable  functions. 


We  could  also  write  APLMAJ(n)  ~  n,  but  the  inclusion  of 
terms  —  + 1  in  (2)  improves  the  approximation's 

precision  for  small  values  of  n.  For  n  =  3  and  n  =  15,  the 
exact  expression  and  the  approximation  of  (2)  differ  by 
4.5  percent  and  0.4  percent,  respectively. 

Note  that,  for  a  general  symmetric  threshold  function 
whose  APL  is  approximated  in  Corollary  1,  we  assume  the 
threshold  t  stays  constant  as  n  — >  oo.  For  the  APL  of  the 
Majority  function  given  in  Corollary  2,  we  assume  that  n  = 
2t  —  1  always  holds  so  that  t  increases  as  n  increases. 

2.7  BTREE  Function 

We  now  consider  the  BTREE  logic  function  that  is 
characterized  by  the  structure  of  its  BDD.  Namely,  it  is  in 
the  form  of  a  binary  tree,  where  each  node  is  labeled  by  a 
unique  variable.  Therefore,  a  BTREE  function  on  n  variables 
has  n  nonterminal  nodes.  We  restrict  BTREEs  to  be 
balanced:  Namely,  no  path  in  a  BTREE  has  a  length  of 
more  than  1  longer  than  any  other  path.  When  n  +  1  is  a 
power  of  2,  the  BTREE  is  a  complete  balanced  tree  and  every 
path  has  length  log2(n  +1).  Fig.  7  shows  a  BDD  of  a  BTREE 
function  on  seven  variables.  The  BTREE  is  important 
because  its  maximum  path  length  is  the  smallest  among  all 
n  variable  functions.  For  a  discussion  of  the  importance  of 
the  longest  path  length  in  BDDs,  the  reader  is  referred  to  [6]. 
For  general  BTREE  functions,  we  can  state 

Theorem  9. 

(n  _|_  1)  _  2Llog2(n+l)J 

APLBTREE(n)  =  |_log2(n+  1)J  +  [log2(n+1)J  • 

Proof.  See  the  Appendix.  □ 

The  BTREE  function  is  interesting  because  of  the 
following: 

Theorem  10.  The  longest  path  length  in  a  BDD  of  an  n-variable 
function  is  bounded  below  by  [log2(n  +  1)].  The  BTREE 
function  achieves  this  lower  bound. 

2.8  Achilles’  Heel  Function 

The  Achilles'  heel  function  [22]  is  often  used  to  show  the 
effect  of  variable  order  on  the  number  of  nodes  in  a  BDD.  In 
this  section,  we  derive  the  APL  of  a  BDD  for  the  Achilles' 


heel  function  for  the  good  order,  which  produces  a  BDD 
with  n  nodes,  and  the  poor  order  which  produces  a  BDD 
with  “S+l  —  2  nodes  [2]. 

Definition  3.  The  Achilles'  heel  function  is 

f{x i,x2,...,xn)  =  XiX-2  V  X3X4  V  . . .  V xn-ixn,  (3) 

where  n  is  an  even  positive  integer. 

First,  we  consider  the  Achilles'  heel  function  using  the  good 
order  (top  to  bottom)  X\,  £2,  £3,  £4, . . . ,  £n-i,  and  xn.  For  this 
case,  the  number  of  nodes  is  n.  We  have 

Theorem  11. 

AP L AchillesGoodOrder{n)  - '  b  ’ 

Proof.  See  the  Appendix.  □ 

As  n  ■  00,  AP L AchuiesGoodOrder(n)  b.  Like  the  unate  cas¬ 
cade  and  comparison  functions,  the  APL  of  the  BDD  for  the 
Achilles'  heel  function  (with  this  order)  is  small. 

Next,  we  consider  the  APL  of  the  BDD  of  the  Achilles' 
heel  function  using  the  bad  order  (top  to  bottom)  £1,  £3,  £5, 
. . .  £n_i  £2,  £4,  £6,  .  •  • ,  and  £„. 

Theorem  12. 

AP L AchillesPoorOrder{n)  ^  T"  2  2  ^  • 

Proof.  See  the  Appendix.  □ 

In  this  case,  AP L AchmesPoar0rder(n)  is  approximately  §  for 
large  n,  in  contrast  to  a  near  constant  6  for  the  APL  of  a  BDD 
with  the  good  order  £1,  £2,  £3,  £4,  . . .  ,  £„_i,  and  xn. 

2.9  Distribution  of  Path  Lengths 

In  this  section,  we  compare  the  distributions  of  path  lengths 
for  various  functions.  These  were  produced  by  the 
generating  functions  used  in  the  Appendix  to  compute 
the  APL.  Fig.  8  shows  these  distributions  for  five  functions 
on  n  =  33  variables.  This  graph  shows  that  the  AND 
function  has  many  short  paths  compared  to  the  Majority 
and  Parity  functions,  for  example.  The  Carry-Out/Equal-to 
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and  BTREE  function  can  also  be  seen  to  have  reasonably 
short  paths.  In  order  to  show  the  detail  of  the  distributions, 
the  y  axis  is  limited  to  0.5,  truncating  the  curves  associated 
with  the  BTREE  and  Parity  functions. 

3  APL  for  Sets  of  Functions 

While  the  values  of  the  APL  in  BDDs  of  example  functions 
are  useful  in  comparing  with  the  APL  of  a  given  function, 
knowing  the  average  of  the  APL  over  sets  of  functions  is 
also  interesting.  We  ask  and  answer  the  question 

What  is  the  expected  value  of  APL  for  three  classes  of 
functions — symmetric  functions,  symmetric  threshold  functions, 
and  all  functions? 

Of  particular  interest  is  whether  the  expected  value  falls 
near  the  maximum,  such  as  the  parity  function,  or  near  the 
minimum,  such  as  the  AND  function.  We  use  AvgAPLs  to 
denote  the  average  of  the  APL  over  all  functions  belonging 
to  the  set  S  of  functions. 

3.1  Set  of  Symmetric  Functions 

It  is  known  [23]  that  the  BDD  of  any  symmetric  function 
dependent  on  n  variables  has  at  least  one  path  that  is  the 
longest  possible,  n.  We  extend  this  by  showing  that 
AvgA  P Lau Sym  ( n) ,  the  average  of  the  APL  of  BDDs  over 
all  symmetric  functions,  approaches  n  as  n  increases. 
Specifically, 

Theorem  13. 

AvgAPL AuSym{n)  =  n  -  1  -  — . 

Proof.  See  the  Appendix.  □ 

3.2  Set  of  Symmetric  Threshold  Functions 

Recall  that  a  symmetric  threshold  function  is  a  symmetric 
function  that  is  1  iff  k  or  more  of  the  variables  are  1.  With 
respect  to  A vgA  P L symThres ,  the  average  APL  over  all 
symmetric  threshold  functions,  we  have 

Theorem  14. 

7L  T  n 

A  vgA  P  L\iiSymrrhri:>  ip)  —  fffffAg y  ■ 

Proof.  See  the  Appendix.  □ 

From  this,  it  follows  that 

Corollary  3. 

Tl 

AvgAPL  AllSymTh  resijl)  ^  2  ' 

This  is  unusual.  Namely,  the  set  of  symmetric  threshold 
functions  represents  a  set  whose  AvgAPL  does  not 
approach  n  as  n  increases. 

3.3  Set  of  All  Functions 

In  the  case  of  symmetric  functions,  we  can  ignore  ordering 
since  any  ordering  yields  a  minimal  BDD.  That  is  not  the 
case  for  arbitrary  functions.  In  the  case  of  all  functions,  we 
determine  an  upper  bound  on  AvgAPLAn(n),  the  APL  for 
all  functions  on  n  variables.  Namely,  we  compute 


TABLE  2 

AvgAPLau{n)  for  3  <  n  <  16  Obtained  by  Averaging 
1,000  Samples  and  from  Theoretical  Calculations 


n 

AvgAP LAu(n,  II) 
Theoretical  (5) 

AvgAP  LA,i(n) 
Experimental 

#  Samples 

3 

2.187500 

2.031250 

t  256 

4 

3.183594 

2.947937 

f  65536 

5 

4.183578 

3.972250 

1000 

6 

5.183578 

4.963125 

1000 

7 

6.183578 

5.996500 

1000 

8 

7.183578 

7.024969 

1000 

9 

8.183578 

8.062383 

1000 

10 

9.183578 

9.089324 

1000 

11 

10.183578 

10.114357 

1000 

12 

11.183578 

11.132417 

1000 

13 

12.183578 

12.146977 

1000 

14 

13.183578 

13.157292 

1000 

15 

14.183578 

14.164699 

1000 

16 

15.183578 

15.169641 

1000 

f  Obtained  by  enumerating  all  functions. 

AvgAPLAu(n,  II),  the  APL  over  all  n- variable  functions  for 
fixed  ordering  II  of  the  variables.  We  have 

Theorem  15. 

1  71-1  1 

AvgAPL All(n)  <  AvgAPLM{nm  =  (n  -  1)  +  -  -  (4) 

i— 1 

Proof.  See  the  Appendix.  □ 

For  n  — >  oo,  we  can  write 

Corollary  4. 

AvgAPL Au(n)  <  AvgAPLAu(n,  II)  ~  n  —  1  +  0.183578.  (5) 

The  upper  bounds  expressed  in  (4)  and  (5)  can  be 
substantiated  by  comparing  them  with  statistical  data.  The 
third  column  of  Table  2  shows  the  APL  for  sets  of  functions 
on  n  variables  for  3  <  n  <  16.  For  n  =  3  and  4,  the  average 
of  the  APL  was  taken  over  all  functions.  For  each  n  in  the 
range  5  <  n  <  14,  1,000  sample  functions  were  generated. 
For  each  function,  the  APL  was  computed  by  a  heuristic 
[11]  using  variable  sifting,  yielding  a  minimal  or  near- 
minimal  APL.  Since  a  minimal  APL  is  not  guaranteed,  the 
experimental  data  in  Table  2  (third  column)  also  represents 
an  upper  bound.  However,  it  is  likely  to  be  a  close 
approximation  to  the  actual  value  of  AvgAP  LAu(n)  and 
we  have  chosen  to  label  the  column  AvgAPLAu(n).  Note 
that  the  data  indeed  shows  an  APL  close  to  n.  As  n 
increases,  AvgAPLAu{n),  as  approximated  by  1,000  sam¬ 
ples,  approaches  the  upper  bound  in  (4),  which  is  given  in 
the  second  column.  That  is,  the  second  column  of  Table  2 
shows  a  theoretical  result,  as  expressed  by  in  (4),  namely, 
the  upper  bound,  AvgAPLAu(n,  II),  derived  by  averaging 
the  APL  over  all  functions,  assuming  that,  in  all  functions, 
the  same  ordering  is  applied  to  the  variables. 
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TABLE  3 

AvgAPLaii{n,k)  for  n  =  6,  8,  and  10  Variables 


k/2n 

AvgAP La,i(n,  k ) 
n  =  6  n  =  8  n  =  10 

Number  of 
Samples 

0.0000 

0.000000 

0.000000 

0.000000 

1 

0.0625 

2.828519 

4.779448 

6.913753 

10,000 

0.1250 

3.762500 

5.760283 

7.880319 

10,000 

0.1875 

4.217962 

6.278444 

8.377202 

10,000 

0.2500 

4.536800 

6.597638 

8.683376 

10,000 

0.3125 

4.741725 

6.802536 

8.878707 

10,000 

0.3750 

4.870406 

6.931597 

9.001665 

10,000 

0.4375 

5.944869 

7.005006 

9.070214 

10,000 

0.5000 

5.968644 

7.027714 

9.091589 

10,000 

3.4  Set  of  All  Functions  with  a  Specified  Number  of 
Minterms 

Instead  of  considering  all  n- variable  functions  as  a  single  set, 
as  we  did  in  the  previous  section,  we  now  divide  this  set  into 
subsets,  according  to  the  number  true  minterms.  A  true 
minterm  of  a  function  /  is  an  assignment  of  values  to  nil 
variables  that  causes  /  to  be  1 .  For  example,  the  AND  function 
has  exactly  one  true  minterm  (x\X2  ■  ■ ■  xn  =  11 ...  1).  Func¬ 
tions  with  few  true  minterms  tend  to  have  simple  BDDs.  We 
seek  to  determine  how  the  number  of  true  minterms  affects 
the  APL.  For  most  functions,  the  number  of  assignments  of 
values  to  the  variables  that  yield  0  nearly  equals  the  number 
that  yield  1.  Flowever,  certain  important  functions,  like  the 
AND  and  OR,  have  a  large  disparity  in  these  two  numbers. 

We  investigated  this  for  functions  on  n  =  6,  8,  and 
10  variables.  For  each  value  of  n,  we  randomly  chose 
10,000  functions  in  which  the  fraction  of  assignments  that 
mapped  to  1  ranged  over  k/2n  =  1/16,  1/8,  3/16,  1/4,  5/16, 
3/8,  7/16,  and  1/2.  For  each  set  of  10,000  functions,  we 
computed  the  APL  using  the  heuristic  method  of  [11].  As  in 
the  previous  section,  the  use  of  a  heuristic  method  means  that 
the  data  represents  an  upper  bound  of  AvgAPLau(n,k)', 
however,  the  values  shown  are  likely  to  be  close  to  the  exact 
values  and  we  use  AvgAPLau(n ,  k)  to  label  the  column.  Data 
above  k/2n  =  1/2  is  similar  since  a  minimal  BDD  for  some 
function  /  can  be  obtained  from  a  minimal  BDD  for  /  by 
interchanging  the  terminal  nodes  0  and  1.  This  latter  data  is 
omitted.  Table  3  shows  the  data.  It  canbe  seen  that  the  APL  for 
functions  increases  with  the  fraction  of  assignments  mapping 
to  1  up  to  0.5.  This  coincides  with  our  intuition  that  functions 
with  small  k  are  simpler  than  functions  with  large  k  and  thus, 
their  BDDs  have  fewer  nodes  and  smaller  APL. 

4  Benchmark  Functions 

The  results  of  the  previous  section  show  that  the  average  APL 
over  all  symmetric  functions  and  over  all  functions  is  close  to 
n,  the  maximum.  This  contrasts  with  the  results  from  the 
section  on  individual  functions,  showing  that  certain  com¬ 
monly  used  functions,  like  the  AND,  OR,  and  carry-out 
functions,  have  small  APL.  Therefore,  it  is  interesting  to 
examine  benchmark  functions  since  these  attempt  to  repre¬ 
sent  functions  used  in  practical  logic  design.  We  selected  189 
benchmark  functions  from  ISCAS  85  [24]  and  the  Logic 
Synthesis  Workshop  (LGSynth  93).  Since  many  have  multiple 


Fiq.  9.  am  versus  n  for  4,352  benchmark  functions. 


Fig.  10.  versus  ^  for  4,352  benchmark  functions. 

outputs,  this  actually  represents  4,352  single-output  func¬ 
tions.  Using  the  heuristic  minimizer  [11],  we  minimized  each 
function's  APL.  Then,  we  plotted  its  APL  divided  by  n  versus 
n,  where  n  is  the  number  of  variables. 

The  results  are  shown  in  Fig.  9.  A  data  point  near  the  top 
corresponds  to  an  average  APL  close  to  n.  Note  the  general 
decline  in  APL/n  as  n  increases.  This  set  of  benchmark 
functions  includes  many  unate  cascade  functions  and  the 
lower  bound  associated  with  these  functions  is  clearly  seen. 
Contrasting  the  case  of  random  functions,  the  values  of 
APL/n  for  benchmark  functions  are  small,  especially  for 
large  n.  Thus,  most  benchmark  functions  have  quite 
different  properties  than  randomly  generated  ones.  The 
suggestion  is  that,  for  functions  found  in  practice,  the  APLs 
of  the  BDDs  are  much  smaller  than  n. 

Fig.  10  shows  this  same  data,  except,  for  the  horizontal 
axis,  instead  of  plotting  n,  we  plot  where  N  is  the 
number  of  nodes  in  the  BDD.  This  shows  that,  when  2;^n  is 
large,  the  APL  tends  to  be  large.  This  shows  that  BDDs  with 
more  nodes  tend  to  have  larger  APL.  We  have  verified  that 
the  lower  bound  line  is  approximately  defined  by  unate 
functions. 
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TABLE  4 

Summary  of  the  APL  in  BDDs  of  Various  Functions  for  Large  n 


Function 

APL 

Unate  Cascade  (e.g.  AND  and  OR) 
Parity 

Carry-out 

Symmetric  Threshold  ( n ,  t) 

Majority 

BTREE  (Balanced  Tree) 

Achilles’  heel 

9  _  l  * 

^  2n_1 
n* 

A _ !_* 

1  4  n 

2min{t,  n  —  t+l) 
n-  ,/sa 

y  7r 

log  2n 

6  -  6(f)** 

All  Functions 

All  Symmetric 

All  Symmetric  Threshold 

<n  —  0.816 
n-t-T, 

nz+n  * 
2(n+2) 

*  Exact  value. 

5  Conclusions 

Because  it  is  related  to  the  time  of  evaluation,  the  APL  of  a 
BDD  is  a  useful  metric  in  analyzing  a  BDD.  We  have  shown 
that  the  APL  for  some  functions,  such  as  the  carry-out, 
depends  strongly  on  the  ordering  of  the  variables.  That  is,  it 
can  be  as  small  as  4  or  as  large  as  n,  for  large  n.  We  have,  for 
the  first  time,  contrasted  and  compared  the  APL  for  a  set  of 
common  functions,  including  AND,  OR,  Exclusive  OR, 
majority,  unate  cascade,  and  Greater-than-or-equal-to  func¬ 
tions.  Included  in  this  set  are  the  worst  as  well  as  the  best 
APL  over  all  n  variable  functions.  Our  approach  has  been  to 
compute  a  distribution  using  generating  functions  from 
which  the  APL  is  derived.  Distributions  of  path  lengths  also 
exhibit  variability  from  wide,  such  as  the  majority  function, 
to  narrow,  as  with  the  parity  function.  Table  4  summarizes 
our  results  on  APL. 

We  have  also  shown  that  the  average  of  the  APL  over 
symmetric  functions  and  over  all  functions  is  large,  approach¬ 
ing  n,  the  maximum  as  n  increases.  It  is  interesting  that  the 
average  APL  over  all  functions  does  not  approach  n  exactly: 
rather,  n  less  a  constant.  We  have  shown  that,  among  all 
n-variable  functions,  the  APL  of  the  parity  function  is 
uniquely  the  function  with  maximum  APL,  n.  We  have 
shown  empirical  evidence  that  suggests  the  actual  average 
approaches  n,  as  n  increases.  In  the  case  of  symmetric 
threshold  functions,  this  average  approaches  For  bench¬ 
mark  functions,  the  APLs  are  usually  much  less  than  n. 

A  topic  only  briefly  discussed  in  this  paper  is  the 
correlation  between  minimum  APL  and  the  minimum 
number  of  nodes.  This  has  been  discussed  in  papers  on 
minimization  algorithms  for  APL  [10],  [11],  where  it  has 
been  observed  that,  for  many  benchmark  functions,  the 
BDD  with  the  minimum  APL  does  not  correspond  to  the 
BDD  with  minimum  node  count.  There  is  an  approximate 
correlation.  For  example,  [27]  shows  that  the  APL  of  a  BDD 
is  the  sum  of  the  node  traversing  probabilities  and  it  follows 
that  reducing  the  node  count  tends  to  reduce  the  APL. 

Appendix 

Theorem  1. 

APLAND(n)  =  2  -  . 


Proof.  We  begin  by  deriving  the  generating  function  [25]  for 
the  distribution  of  path  lengths  in  the  BDD  of  the  AND 
function.  For  the  AND  function  on  n  variables,  there  are 
2n~1  assignments  of  values  for  which  the  path  length  is  1 
(aii  =  0  and  two  choices  for  each  of  the  remaining  n  —  1 
variables).  This  contributes  2n~1z1  to  the  generating 
function.  Similarly,  there  are  2n~2  assignments  of  values 
to  variables  that  have  path  length  2  and  these  contribute 
2 n~2z2  to  the  generating  function.  The  other  terms  are 
generated  in  a  similar  manner.  Therefore,  for  the 
n-variable  AND,  the  generating  function  for  the  path 
length  distribution  is 

2n~1z  +  2  n~2z2  +  •  •  •  +  2  2zn~2  +  21zn~1 

2n  _ 

+  2  •  2  V  =  - +  zn  -  2n. 

Dividing  this  by  2n  yields  Gand(u)  (-)/  a  generating 
function  for  the  fraction  of  paths  of  length  1, 2, 3, ... . 

Gand(u){z)  =  1  +  g)V  1.  (6) 

The  APL  for  the  AND  function,  APLand („)  is 
calculated  by  summing  the  path  lengths  and  dividing 
by  2n.  This  can  be  done  by  differentiating  GAND(n){z) 
with  respect  to  2  (forming  a  weighted  sum),  multiplying 
by  z,  and  setting  2=1.  Performing  these  steps  on  (6) 
yields  the  theorem.  □ 

Theorem  2.  The  unate  cascade  functions  are  uniquely  those 
functions  whose  BDDs  have  the  smallest  (2  —  A-j)  APL 
among  all  functions  that  depend  on  n  variables. 

Proof.  It  is  clear  that  /( X)  is  a  unate  cascade  function  iff  it 
has  a  BDD  in  which  all  internal  nodes  have  one  edge 
going  to  a  terminal  constant  node.  It  remains  to  show 
that  this  BDD  has  the  smallest  APL.  The  proof  proceeds 
by  induction  on  n.  For  n  =  1,  a  1-variable  unate  function 
has  a  BDD  whose  APL  is  1,  which  is  the  smallest  of  all 
1-variable  functions. 

Assume  the  theorem  holds  for  n  =  m  —  1.  A  BDD  for 
an  m-variable  function  with  the  smallest  APL  has  a 
structure  in  which  the  root  node  connects  to  a  constant 
node  by  a  single  edge  and  the  other  connects  to  a  BDD 
whose  APL  is  the  smallest  for  functions  of  m  —  1  variables. 
The  APL  is  a(m)  =  |  +  ^a(m  —  1),  where  a(m  —  1)  is  the 
smallest  APL  among  m  —  1-variable  functions.  Solving 
this  recurrence  relation  with  the  initial  conditions  a(l)  = 
1  yields  a(m)  =  2  —  Det-  LI 

Theorem  3.  The  parity  functions  are  uniquely  those  functions 
whose  BDDs  have  the  largest  (n)  APL  among  all  functions 
that  depend  on  n  variables. 

Proof.  In  the  BDD  for  one  parity  function,  the  EXOR,  all 
paths  have  length  n.  This  can  be  seen  from  Fig.  3,  or  it 
can  be  inferred  from  the  fact  that  every  variable  is 
indispensable,  i.e.,  every  variable  value  is  needed  to 
determine  the  function  value,  and  all  paths  have  an  arc 
labelled  by  every  variable.  Thus,  APLEXoR(n)  =  n.  A 
similar  statement  is  true  of  the  other  parity  function,  the 
EXNOR,  which  is  the  complement  of  the  EXOR. 
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Consider  any  function  /  other  than  a  parity  function. 
It  follows  that  there  are  two  assignments  A±  and  A2  of 
values  to  the  variables  that  differ  in  exactly  one  variable, 
say  Xj,  such  that  the  function  has  the  same  value  for  both 
assignments.  Consider  a  BDD  with  Xj  at  the  bottom.  It 
follows  that  a  path  from  the  root  node  to  a  terminal  node 
corresponding  to  either  assignment  A\  or  A2  has  n  —  1  or 
fewer  edges  since  Xj  need  not  be  evaluated.  Since  at  least 
one  path  has  length  n  —  1  or  less,  the  APL  of  /  is  strictly 
less  than  n.  □ 

Theorem  6.  The  smallest  APL  in  a  BDD  for  the  Greater-than-or- 
ecjual-to  function  is  4  —  -}}  and  is  achieved  when  imriables  are 
ordered  in  descending  significance. 

Proof.  Let  X  =  {xb~i,Xb~2,  ■  •  • ,  Xq}  and 

Y  =  {2/6-1, 2/6-2,  —  ,  S/o} 

be  the  input  variables  of  the  Greater-than-or-equal-to 
function.  Let  x  =  Xb~i2b~1  +  Xb~ 22b~2  +  . . .  +  xo2°  and  y  = 
yb- 126-1  +  2ft_226-2  +  . . .  +  j/o2°  be  the  standard  binary 
representations  of  the  X  and  Y  variables,  respectively. 
Then,  the  Greater-than-or-equal-to  function  fx>v( X,  Y)  = 
1  iff  m  >  y. 

The  proof  proceeds  by  induction  on  b,  where 
b  =  |A|  =  |y|.  For  6=1,  the  Greater-than-or-equal-to 
function  is  fx>v  =  xo  V  yo,  which  is  a  unate  cascade 
function,  such  that  APL  =  |.  Since  4  —  =  |,  for  6=1, 

the  theorem  holds  for  6=1.  Assume  it  holds  for 
fx'>1/(X\  Yr),  where  \X'\  =  \Y'\  =  b  —  1,  and  consider 
the  case  of  fx>y(X,  Y),  where  |X|  =  |y|  =  b.  The  proof 
proceeds  by  showing  that  the  minimum  APL  occurs 
when  the  two  most  significant  bits,  Xb-i  and  yb-i,  are  the 
top  two  variables  in  the  BDD.  This  yields  a  BDD 
containing  the  BDD  of  a  6—1  bit  Greater-than-or- 
equal-to  function.  In  turn,  its  minimum  APL  is  achieved 
when  the  two  most  significant  bits,  Xb- 2  and  yb- 2,  are  at 
the  top,  etc.  The  theorem  follows  from  this. 

For  all  orderings  of  variables  in  the  BDD  of  the 
Greater-than-or-equal-to  function,  there  is  no  edge  from 
the  root  node  to  a  terminal  node  since  the  determination 
of  whether  x  >  y  or  not  requires  at  least  two  variables. 
Therefore,  for  all  orderings  of  the  BDD  of  the  Greater- 
than-or-equal-to  function,  all  paths  have  length  at  least  2 
and  the  BDD  has  the  structure  shown  in  Fig.  11.  Here, 
21,22  G  {xb-i,Xb~2,  ■  ■  ■  ,Xq,  2/6—1, 2/6—2,  ■  •  • ,  2/o }  and  z\  /  22- 

In  this  structure,  the  four  triangles  represent  the  BDDs 
of  subfunctions  of  the  Greater-than-or-equal-to  function 
corresponding  to  z\ z2  =  00,  01,  10,  11.  We  consider  three 
cases:  1)  {zi,  z2}  =  {xb-i,  2/6-1 },  2)  {zi,  z2}  =  {xk,  yk}, 
k  f  b  —  1,  and  3)  all  other  choices  for  z\  and  z2-  In 
Case  1,  Bq\  and  Bw  are  null;  the  edges  going  to  their  root 
nodes  go  to  terminal  nodes,  0  and  1,  respectively. 
Further,  B00  and  Bn  are  identical  and  represent  the 
BDD  of  a  Greater-than-or-equal-to  function  with  6—1  bit 
pairs.  By  induction,  an  optimum  ordering  of  that  BDD 
yields  an  APL  of  4  —  .  Thus,  the  APL  of  the  BDD  with 

{21,22}  =  {2:6-1, 1/6- 1}  is 


12  +  1|2  +  4_26_i 


5  ^  —  4  —  A 
2b 


(7) 


Fig.  1 1 .  BDD  Structure  of  the  Comparison  Function. 


In  Case  2,  B00  and  Bn  are  BDDs  for  Greater-than-or- 
equal-to  functions  on  6—1  variables.  B0i  and  Bw 
represent  BDDs  that  depend  on  only  variables  of  higher 
significance  than  i.  The  derivation  of  the  APL,  for  this 
case,  is  similar  to  that  of  Case  1  and  yields  a  value  greater 
than  that  of  Case  1. 

Now,  consider  Case  3,  where  {21,22}  f  {2^,2/;}-  We 
show  that,  in  Fig.  11,  all  sub-BDDs,  Boa,  Bm,  Bio,  and  Bn 
depend  on  all  26  —  2  variables.  Thus,  a  lower  bound  on 
the  APL  for  this  case  is 


2  +  -LB(n  -  2)  +  -LB(n  -  2)  +  -LB(n  -  2) 

}  4  4  (8) 
+  ^LB(n  -  2)  =  2  +  LB(n  -  2), 

where  LB(n  —  2)  is  a  lower  bound  on  the  APL  of  a 
function  of  n  —  2  variables.  From  Theorem  2,  this  is 
2  —  2^3 .  Thus,  the  APL  for  this  case  is 


4- 


1 

226=3’ 


(9) 


which  is  also  larger  than  the  APL  for  Case  1.  Therefore, 
the  smallest  APL  occurs  with  Xb-i  and  yj_ \  as  the  top  two 
variables. 

We  now  complete  the  proof  by  showing  that  each  B0 0, 
Bqi,  Bio,  and  Bn  realizes  a  function  dependent  on  n  — 
2  =  2(6  —  1)  variables.  We  do  it  by  contradiction.  That  is, 
on  the  contrary,  suppose  that  some  variable,  say  Xi  (?/,)  is 
not  tested  in  Bj  e  {B00,  B0i,  Bio,  Bn}-  Then,  consider  an 
assignment  of  values  such  that  yt  =  1  ( Xi  =  0)  and  all 
other  values  are  0(1).  Then,  whether  x  >  y  depends  on  Xi 
(yf),  the  missing  variable.  It  follows  that  all  variables  are 
tested  in  Bj.  □ 


Theorem  7. 


k  fay) 

APLS_THRES(n,t)  =  2 k~  ^  J.  j ,  (10) 

3= 1 

where  k  =  min{t,  n  —  t  +  1}. 

Proof.  The  BDD  of  a  symmetric  threshold  function  is  a 
rectangle  of  t  x  (n  —  t  +  1)  nodes,  one  corner  of  which 
serves  as  the  root  node  and  the  opposite  comer  is  just 
above  the  two  terminal  nodes.  For  example,  Fig.  3a 
shows  the  case  of  an  n  x  1  rectangle,  representing  a 
symmetric  threshold  function  where  t  =  n  (AND  func¬ 
tion).  Also,  Fig.  6  shows  the  case  of  a  3x3  rectangle 
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representing  a  symmetric  threshold  function  with  t  = 
n^[  for  n  =  5  (majority  function).  There  are  n  —  t  +  1  arcs 
emerging  from  the  right  side  of  the  rectangle  going  to 
nonterminal  node  1.  And,  there  are  t  arcs  emerging  from 
the  left  side  of  the  rectangle  going  to  nonterminal  node  0. 
Since  all  paths  from  the  root  node  to  a  nonterminal  node 
pass  through  one  of  these  arcs,  we  can  calculate  the  APL 
by  summing  the  fraction  of  paths  associated  with  each 
arc  multiplied  by  the  path  length  associated  with  that 
arc.  Doing  this  yields  the  generating  function  for  the 
distribution  of  path  lengths  for  the  symmetric  threshold 
function  with  threshold  t  as 


Gs_THRES(n,t)  {%) 


E 

j=k 


J~1)  +  (3~1 
k  —  1  /  \n  —  k 


where  k  =  rnin{f. n  —  t  +  1}. 

Differentiating  Gs_THRES(n,t){z )  with  respect  to  z, 
multiplying  by  z,  and  setting  2  to  1  yields  a  weighted 
sum  which,  divided  by  2n,  yields  the  APL  given  in  (10). □ 

Theorem  8. 


APLMAJ(n)  -  n - y  +  (11) 

where  n  is  odd. 

Proof.  The  BDD  of  the  majority  function  is  a  square  of  nodes 
with  one  corner  at  the  root  node  and  the  other  corner  just 
above  the  two  terminal  nodes.  Fig.  6  shows  the  BDD  of 
the  five  variable  majority  function.  There  are  t+  1 
different  path  lengths.  The  shortest  occurs  when  the 
first  t  variables  are  all  1.  In  this  case,  the  majority 
function  is  1,  regardless  of  the  values  of  the  remaining 
t—  1  variables.  There  are  2t_1  assignments  to  these 
variables,  each  corresponding  to  a  path  of  length  t.  Thus, 
the  contribution  to  the  path  length  distribution  is 
expressed  as  2*~ 1zt.  The  next  shortest  paths  correspond 
to  assignments  of  values  to  the  variables  in  which  there 
are  t  Is  and  one  0.  This  yields  a  contribution  to  the 
distribution  of  2 f-2(*)2t+1.  The  contribution  associated 
with  the  path  length  assignments  in  which  there  are  t  Is 
and  two  0s  is  2 t+1  zt+2 ,  etc.  Thus,  the  generating 
function  for  the  path  length  distribution  is 


2 1  2t~1zt  +  2t~2 


zt+ 2  +  . . . 


j+i  2*— 3 


/2 1  -  2\ 

U- 1  ) 


t  +  1 
2 

2t—l 


The  factor  of  2  occurs  because,  for  every  path  from  the 
root  node  to  terminal  node  1  associated  with  t  Is  and  i  0s, 
there  is  a  path  from  the  root  node  to  terminal  node  0 
associated  with  t  0s  and  i  Is.  Rearranging  this  expression 
yields 


GMAJ(n)(z)  =  +  (12) 

where  n  =  2t  —  1,  for  t  =  1,2, _ 

We  compute  APLMAj(n)  by  summing  the  path  lengths 
over  all  assignments  of  values  to  the  variables  and 
dividing  by  2n ,  the  number  of  assignments.  If  all  path 


lengths  are  the  maximum  value,  n,  the  sum  of  path 
lengths  over  all  assignments  is  n 2n.  Thus, 


APL 


MAJ{ri) 


n2n  -  R(n) 


(13) 


where  R(n)  is  a  reduction  in  the  maximum  weighted 
sum  caused  by  paths  that  do  not  reach  the  maximum 
length.  In  the  case  of  the  majority  function,  all  such  paths 
extend  from  some  lower-middle  node  to  a  terminal  node. 
For  example,  if  the  first  (top)  m+  1  variables  are  1,  where 
m  =  2=l,  then  the  majority  function  is  1  regardless  of  the 
value  of  the  remaining  m  variables,  which  can  be  chosen 
in  any  of  2m  ways.  In  the  BDD  of  the  majority  function, 
this  corresponds  to  2m  paths,  each  contributing  a 
reduction  of  m  to  R(n),  for  a  total  reduction  of  2 mm.  In 
the  case  where  the  first  m  +  2  variables  have  m  +  1  Is 
and  one  0,  the  majority  function  is  1  regardless  of  the 
value  of  the  remaining  m  —  1  variables,  which  can  be 
chosen  in  2m~1  ways.  In  the  BDD  of  the  majority 
function,  this  corresponds  to  2m_1  paths,  each  contribut¬ 
ing  a  reduction  of  m  —  1  to  R(n).  Since  there  are  (mjF1) 
ways  to  choose  where  the  0  occurs,  the  total  contribution 
to  the  reduction  R(n)  in  this  case  is  (mjfl)2 m_1(m  —  1).  In 
general,  where  the  first  m  +  1  +  i  variables  have  exactly 
i  0s,  the  contribution  to  R(n)  is  (™+!)2 m~l(m  —  i )  and  we 
can  write 


R(n)=2'£(m^t)‘2m-i(m-i)i  (14) 

i=o  V  1  / 

where  m  =  .  The  factor  2  occurs  because  there  are 

corresponding  paths  that  go  to  terminal  node  0. 

From  Riordan  [26],  the  sum  in  (14)  represents  the 
probability  distribution  associated  with  Banach's  match¬ 
box  problem,  which  has  a  closed  form  solution.  Namely, 


E 


m  +  i 


2m~i(rn  —  i)  =  (m+  1) 


2  to  +  1 
m 


2~lm  -  1. 


Combining  (13),  (14),  and  (15)  yields  (11). 


(15) 

□ 


Theorem  9. 


(n  +  i)  _  2Llog2(n+i)J 

APLmREE[n)  =  [log2 (n  +  1)J  +  -  Llog2(n+1)J - •  (16) 

Proof.  In  the  BDD  of  the  BTREE  function  on  n  variables, 
when  n  =  2aG)  —  \  for  a(n)  =  1,2,...,  all  2n  assign¬ 
ments  of  values  to  the  variables  correspond  to  a  path 
of  length  a(n)  =  [log2nJ.  For  other  values  of  n, 
namely,  for  2“^  —  1  <  n  <  2aG)+1  —  l,  there  are  (2aG)  — 
^)22“+£-i-a(n)  assignments  of  values  to  the  variables  that 
correspond  to  a  path  of  length  a(n)  and  E22“+e-1-°‘G)  to 
a  path  of  length  a(n)  +  1.  From  this,  the  generating 
function  for  the  distribution  of  path  lengths  in  the  BTREE 
function  is  given  as 
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Gbtbe^z)  =  [1  -  in  -  2^"J  + 
+  [n-  2Llog2nJ  +  1]2 


-L]Og2»JzLlog2«J+1 


(17) 


APL 


Achilles  Poor  Or  der{ri) 


Differentiating  GBTREE(n)(z)  with  respect  to  z,  multi¬ 
plying  by  z,  and  setting  z  to  1,  yields  a  weighted  sum 
which  divided  by  2n  yields  the  APL  given  in  (16).  □ 


Theorem  11. 


APL 


AchillesGoodOrder{n ) 


=  6-6 


2 


Proof.  The  BDD  for  the  Achilles'  Heel  function  in  the  good 
order  is  a  cascade  of  sections  each  containing  a  BDD  of 
the  two  variable  AND.  A(Z)  =  1  z2  +  \z  is  the  distribu¬ 
tion  of  paths  from  the  root  node  of  this  section  to  the 
logic  0  side.  The  generating  function  for  the  distribution 
of  paths  from  the  root  node  of  the  Achilles'  Heel  good 
order  BDD  is  just  A(z)1,  as  each  A(z)  enumerates  the 
path  through  the  corresponding  section.  Similarly, 
\z2  +  \z2A(z)  +  \z2A(z)2  +  . . .  +  \z2A(zfi~1,  where  \z 2 
expresses  the  path  probability  beginning  from  the  root 
node  of  a  section  and  the  various  A(z)*  terms  represent 
the  ways  to  go  to  a  root  of  a  section.  Adding  the  two 
generating  functions  yields  the  generating  function  for 
the  distribution  of  path  lengths  for  both  the  0  and  1 
nonterminal  nodes 


1  l_flr2  +  l  w/2 

r  —  1  ~2  ^  ‘i  ' 

’ g*  AchillesGoodOrder{n)\Z ■)  ^ 

4  L  —  2  +2%) 

(\  ,  1  \n/2 

+  [S  +  2z] 


(18) 


We  derive  the  APL  by  differentiating 


G AchillesGoodOrder(n)  (  z) 

with  respect  to  z,  multiplying  by  z,  and  substituting 
1  — >  z.  Doing  this  yields  the  theorem.  □ 


Theorem  12. 

Tl 

AP  L  AchillesPoorOrder(n)  TJ  “  - 

Proof.  The  BDD  for  the  order  x\,  £3,  £5,  ...  xn-i  x4,  x4, 
. . . ,  and  xn  consists  of  a  complete  balanced  tree  labeled 
by  x\,  £3,  £5,  ...  £n_i  in  which  all  paths  have  lengths 
Each  leaf  in  this  tree  is  the  root  node  of  a  different 
sub-BDD  of  a  function  that  is  the  OR.  For  example,  in 
the  case  where  £1  =  £3  =  £5  =  . . .  =  £n_i  =  1,  the  sub- 
BDD  is  for  the  OR  function  £2  V  £4  V  £6  V  ...  V  xn.  The 
APL  for  paths  beginning  with  £1  =  £3  =  £5  =  . . .  = 
£n_i  =  1  is  S  +  2  — ,)  since  2  — is  the  APL  for  an 
|-variable  OR  function.  For  other  values  of  £1,  £3,  ..., 
and  xn-\,  the  sub-BDD  is  that  of  an  OR  function  on 
a  variables,  where  a  is  the  number  of  Is  among  £1,  £3, 
. . . ,  and  £t1_i  .  It  follows  that 


Distributing  the  sum  over  the  two  terms  in  (2  —  and 
expressing  each  in  closed  form  yields  the  theorem.  □ 

Theorem  13. 


AvgAPLAusym(n)  =  n  -  1  -  (19) 

Proof.  A  symmetric  function,  f(x4,  £2, ...,  xn)  has  the 
Shannon  decomposition,  /  =  £1/0  V  £1/1,  where  fo  = 
/|  0  and  /1  =  f\xi=v  From  this  realization,  we  have 

AvgAPLAUSym (n)  =  ^(1  +  AvgAPLAusym{n  -  1)) 

+  9(1  +  AvgAPLAusym{n  —  1))  —  ^yr- 

The  factor  of  )  corresponds  to  the  fact  that  one-half  of 
the  paths  begin  in  0  and  one-half  in  1.  The  1  +  term 
corresponds  to  the  edge  from  the  root  node  to  the  BDDs 
realizing  f0  and  /j.  The  term  —  jsr  corresponds  to  a 
deduction  associated  with  choosing  both  f0  and  fi  as  a 
constant  0  or  1  function.  These  contribute  ,2f , ,  but  they 
should  contribute  0.  Solving  this  recurrence  relation 
yields  (19).  □ 

Theorem  14. 


AvgAP  L  AUSymTh  res  (^) 


n2  +  n 
2(n  +  2)  ’ 


Proof.  There  are  n  +  2  symmetric  threshold  functions  on 
n  variables,  including  the  constant  0  and  1  functions, 
corresponding  to  thresholds  0  and  n  +  1,  respectively.  If 
all  paths  in  each  have  maximum  length,  then  the 
weighted  sum  over  all  paths  is  2 nn  as  there  are  2"  paths, 
each  of  length  n.  However,  many  paths  are  truncated 
and  the  weighted  sum  is  actually  2 "n(n  +  2)  —  a(n), 
where  a{n)  is  a  reduction.  The  reduction  occurs  because 
paths  are  truncated  when  enough  is  known  to  comple¬ 
tely  determine  that  the  threshold  is  exceeded  or  not 
exceeded  regardless  of  the  remaining  variable  values. 
For  the  two  extreme  thresholds,  the  function  is  a 
constant  0  or  1  and  the  reduction  is  2 nn,  completely 
eliminating  the  BDD.  This  explains  the  2n+1n  term  in 


n—  1  n—l—k 

a(n)  =  2n+1n  +  2^2kk  ^ 
k—0  j—0 


n  —  1  —  k 

j 


(20) 


In  (20),  the  sum  over  k  counts  the  reduction  when  the 
threshold  is  not  exceeded  and,  thus,  the  factor  of  2 
accommodates  the  case  when  the  threshold  is  exceeded 
as  well  since  the  two  cases  are  symmetrical.  The  sum 
over  k  enumerates  the  cases  where  there  are  k 
variables  below  the  present  level  and  none  are  needed 
to  determine  the  function  value  (it  is  0,  regardless  of 
these  variable  values).  There  are  2k  ways  to  choose 
these  values  and  each  choice  reduces  the  path  length 
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by  k.  The  sum  over  j  enumerates  the  ways  Os  can  be 
chosen  for  variables  in  the  top  of  the  BDD  such  that 
the  threshold  is  not  exceeded,  j  =  0  corresponds  to  the 
threshold  t  =  k  + 1.  In  this  case,  there  is  = 

("o  0  =  1  way  to  choose  none  among  the  n  —  1  —  k  = 
n  —  t  top  variables  to  be  1.  One  more  0  results  in  n  — 
t  +  1  Os  among  the  top  variables  and  guarantees  the 
function  is  0.  j  =  1  corresponds  to  the  threshold 
t  =  k  +  2.  In  this  case,  there  are  ("“J-*')  =  (n_{+1)  = 
n  —  t+  1  ways  to  choose  one  among  the  top  n  —  t  +  1 
variables  to  be  1.  One  more  0  results  in  n  —  t+1  Os 
among  the  top  variables  and  guarantees  the  function  is 
0.  Similarly,  this  argument  applies  to  values  of  j  up  to 
n  —  1  —  k. 

The  summation  over  j  is  just  2n~1~k.  Substituting  into 
(20)  and  rearranging  yields 

n—  1 

a(n)  =  2n+1n  +  2n^i.  (21) 

z=0 

The  summation  is  n<~n~1^ .  Substituting  yields 

a(n)  =  (n2  +  3n)2"_1.  (22) 

We  have 

.  ArtT.  ,  .  2nn(n  +  2)  —  a(n)  .  . 

AvgAPL  /\  USyrnT  hres  (jf  =  2n(n  +  9)  ’  (^) 

Substituting  (22)  into  (23)  yields  the  theorem.  □ 

Theorem  15. 

1  n_1  1 

AvgAPLAu(n )  <  AvgAPLAU(n,  II)  =  (n  -  1)  +  -  -  ^  ^ . 

i— 1 

(24) 

Proof.  A  general  function,  f(x i,X2,  —,xn)  has  the  Shannon 
decomposition,  /  =  x±fo  V  x\f\,  where  fo  =  f  |  0  and 
fi  =  f  |  j.  From  this  realization,  we  have 

AvgAPLAU(n ,  II)  =  ^  (1  +  AvgAPLAU(n  -  1,  II)) 

+  \(1  +  AvgAP LAU (n  -  1,  II))  -  . 

Here,  the  r  term  is  a  deduction  for  the  case  where  both 
subfunctions  of  the  Shannon  decomposition  are  the 
same.  For  each  such  subfunction,  the  root  node  and  the 
two  arcs  incident  to  it  are  redundant  and  the  APL  is  one 
more  than  it  should  be.  Solving  this  recurrence  relation 
yields  (24).  □ 
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